arXiv:1507.08688vl [math.PR] 30Jul2015 


Stein’s method for functions of 
multivariate normal random variables 

Robert E. Gaunt 

Department of Statistics 
University of Oxford 
1 South Parks Road 
Oxford 0X1 3TG 
United Kingdom 

Abstract: It is a well-known fact that if the random vector W con¬ 
verges in distribution to a multivariate normal random variable E 1 ^ 2 Z, 
then g( W) converges in distribution to g(E 1,/2 Z) if g is continuous. In 
this paper, we develop a general method for deriving bounds on the 
distributional distance between g( W) and g( E 1,/2 Z). To illustrate this 
method, we obtain several bounds for the case that the j-component of 
W is given by Wj = -j= ^ij , where the Xij are independent. In 

particular, provided g satisfies certain differentiability and growth rate 
conditions, we obtain an order bound, for smooth test func¬ 

tions, if the first p moments of the Xij agree with those of the normal 
distribution. If p is an even integer and g is an even function, this con¬ 
vergence rate can be improved further to order n~ p ^ 2 . We apply these 
general bounds to some examples concerning asymptotically chi-square, 
variance-gamma and chi distributed statistics. 
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1. Introduction 

In 1972, Stein [28] introduced a powerful method that allows one to bound 
the distance between the distributions of a random variable W and a stan¬ 
dard normal random variable Z with respect to a probability metric. The 
basic approach, as described in detail in [29], involves two steps. The first is 
to solve the so-called Stein equation 

f"(w) - wf'(w) = h(w)-Eh(Z), (1.1) 

where the test function h is real-valued. Bounds for the solution, /, and its 
derivatives are then established. In the second step, the expectation 

nf(w) - wf(w)] 

i 


( 1 . 2 ) 
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is bounded, typically through the use of coupling techniques, which, via 
(1.1), leads to a bound for the quantity of interest E h(W) — E h(Z). 

By recognising the left-hand side of the Stein equation (1.1), [2] and [15] 
extended Stein’s method for normal approximation to the multivariate nor¬ 
mal distribution. A generalisation of (1.1) to the multivariate normal distri¬ 
bution with mean 0 and covariance matrix E (see [14]) is given by 

V T SV/(w) - w T V/(w) = h( w) - Eh(E 1 / 2 Z), (1.3) 


where Z denotes a random vector having standard multivariate normal dis¬ 
tribution of dimension d. The solution of (1.3) is given by 

n oo _ 

/(w) = -/ [E/i(e -s w + y/l - e- 2s E 1/2 Z) - Eh(E 1 / 2 Z)] ds (1.4) 

Jo 

and, if h is n times differentiable, satisfies the bound (see [2] and [14]): 


d k f{ w) 

1 

< - 

d k h(w) 


“ k 

n U dw *i 


(1.5) 


where ||/|| := ||/||oo = su PzeR \f( x )\- If we a I so suppose E is positive definite, 
we can obtain a bound involving one fewer derivative of h (see [8]): 


a*7(w) 

nU dw » 


< 


gj) 

y/2 T(*±i) 


min {| r °w ( , ( E- 1/2) | 


^-^(w) 


k> 2, 

( 1 . 6 ) 


where [row*; (E” 1 / 2 )! is the Euclidean norm of the ii -th row of E -1 / 2 . Similar 
bounds for the derivatives of / as a /c-linear form can also be found in [8] 
and [22], It was shown by [4] that the solution of (1.1) satisfies the bound 


||/( fc )|| < 2||h/ fc - 2 )||, k> 3. (1.7) 


This bound has the attractive property of involving two fewer derivatives of 
the test function h than the solution /, although this improvement is not 
possible for multivariate case (see [25]). 

It is a well-known fact that if the random vector W converges in distri¬ 
bution to a multivariate normal random variable E 1//2 Z, then for any con¬ 
tinuous function g : —>• M, g( W) converges in distribution to g(E 1//2 Z). 

In this paper, we use Stein’s method to assess the rate of convergence of the 
random variable g( W) to its limiting distribution < 5 r(E 1 / 2 Z). We obtain gen¬ 
eral results, which hold for functions g that satisfy certain differentiability 
and growth rate conditions (which will be described shortly). 
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Many standard probability distributions arise naturally as functions of 
multivariate normal random variables, such as the chi-square (xfd) = + 

• • • + Zj), chi, log-normal and t-distribution; for further examples see [30]. 
Moreover, many widely used statistics arise as functions of asymptotically 
multivariate normally distributed random variables, such as Pearson’s statis¬ 
tic, Friedman’s statistic and the popular D 2 , -Df an d ^2 statistics from 
alignment-free sequence comparison (see [19] and [27]). Also, limiting dis¬ 
tributions involving functions of multivariate normal random variables have 
recently occurred in the context of Malliavin calculus, such as variance- 
gamma [6] and linear combinations of chi-square random variables [1]. 

One of the strengths of Stein’s method is that it is readily adapted to 
other distributions; for a comprehensive overview see [18]. In particular, the 
method has been extended to many distributions that occur as functions 
of multivariate normal random variables, such as the chi-square [11], [20], 
chi [24], half-normal [5], variance-gamma [7] and products of normal and 
chi-square random variables [9], [10]. 

In adapting Stein’s method to these distributions, a Stein equation and 
bounds on the derivatives of its solution needed to be derived. For certain 
distributions this can be difficult; for example, the non-symmetric variance- 
gamma distribution [7] and product normal distribution [9] for which only 
limited progress has been made towards obtaining bounds for the deriva¬ 
tives of the solution. The approach described in this paper (sketched out 
shortly), which involves considering the multivariate normal Stein equation 
rather than the distribution’s specific Stein equation removes this difficulty 
by treating all distributions that arise as functions of multivariate normal 
random variables in one general framework. 

Moreover, in certain situations it may be more natural to frame the solu¬ 
tion of a problem in terms of the multivariate normal Stein equation than 
the Stein equation for the limiting distribution. Recently, [11] used such an 
approach to obtains bounds on the rate of convergence of Pearson’s statistic 
to its limiting chi-square distribution. Indeed, the techniques developed in 
this paper can be readily applied to assess distributional approximation for 
many important statistics. 

To obtain distributional approximations for statistics that are asymptot¬ 
ically distributed as a function of a multivariate normal, we simply consider 
the multivariate normal Stein equation (1.3) with test function h(g( w)): 

V r EV/(w) - w T V/(w) = h(g( w)) - E%(£ 1/2 Z)). 


( 1 . 8 ) 
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One can then bound the expectation 

E[V t £V/(W) - W T V/(W)] (1.9) 

using the various coupling techniques developed for multivariate normal ap¬ 
proximation (see [13], [14], [26] and [22]). However, in general the derivatives 
of the test function h(g( w)) will be unbounded (for the distribution, 
g(w) = w 2 and g'(w) = 2w) and therefore the derivatives of the solution 

roo 

/(w) = - / [E%(e~ s w + \/l — e _2s E 1/2 Z)) - E%(E 1/2 Z))] d s (1.10) 

Jo 

will also in general be unbounded. Therefore one cannot apply inequalities 
(1.5), (1.6) and (1.7) to bound the solution’s derivatives. 

In Section 2, we obtain general bounds, which apply for all g that sat¬ 
isfy certain differentiability and growth rate conditions, for the solution 
(1.10). As special cases of these general bounds, we obtain bounds for 
the case that the lower order partial derivatives of g have a polynomial 
(A + B Ylk=i \ w kY k i where the r *. are non-negative) or exponential growth 
rate (Aexp(t Ylt-i |^fc| c ), where 0 < c < 2). Our bounds are in general 
unbounded as |w| —» oo, which means that more care is needed in bounding 
the quantity (1.9) than in the usual multivariate normal setting in which 
the uniform bounds (1.5), (1.6) and (1.7) can be applied. 

Since our bounds for the partial derivatives of the solution of the Stein 
equation involve derivatives of the test function h, the approximation theo¬ 
rems considered in this paper will only be for smooth test functions. Bounds 
for non-smooth test functions are often more informative (see, for example, 
[15]), although an advantage of working with smooth test functions is that 
it is sometimes possible to obtain improved error bounds that may not hold 
for non-smooth test functions. Indeed, a standardised sum of independent 
random variables with first p moments equal to those of the standard normal 
distribution converges to this distribution at a rate of order n - ^ -1 " 2 for 
smooth test functions (see [8], [12] and [17]). In fact, it has also been shown 
by [7] and [11] that certain asymptotically chi-square and variance-gamma 
distributed statistics converge to their limiting distribution at a rate of order 
n~ l for smooth test functions even when the third moments are non-zero. 

In Section 3, these results concerning faster than order n -1 / 2 convergence 
rates are put into a general framework. Let Ayi,..., A ni; i,..., ■ ■ ■ > Xn d ,d 

be independent random variables with mean zero and unit variance, and set 
W = (Wi,..., Wfi) T , where Wj = ^== Xij- Then, for all continuous 
functions g, the statistic g( W) converges in distribution to g( Z) as n —>• oo, 
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where n = mini <j<d n j. In Section 3, we present general bounds for the 
distributional distance between these random variables with respect to a 
smooth test function metric (Theorems 3.2-3. 5). 

Our results can be summarised as follows. If the first p moments of the 
X{j are equal to those of the standard normal distribution, then our bound 
on the distance between g( W) and g{ Z) is of order n~ (p - 1 )/ 2 . This result 
generalises the results of [8], [12] and [17]. If p is an even integer and we also 
suppose that g is an even function ( g(—w ) = g{ w) for all w 6 M d ), then we 
can use symmetry considerations, as introduced by [7] and [11], to improve 
this convergence rate further to order n _P//2 . In particular, g( W) converges 
to g(Z) at rate re -1 even if the third moments of the X %3 are non-zero. 
This bounds generalises the bounds of [7] and [11] since their chi-square and 
variance-gamma statistics are of the form g{ W). 

These bounds hold for all functions g whose lower order partial derivatives 
have polynomial growth. The number of derivatives for which this condition 
must hold increases if we seek a faster rate of convergence. Also, this condi¬ 
tion can hold for fewer derivatives if d = 1. It would be possible to extend 
these results to the case that the derivatives of g have exponential growth, 
although for space reasons we omit this. We discuss this in Remark 3.6. We 
also restrict our attention to independent random variables Xij to simplify 
the exposition. However, the approach developed in this paper could cer¬ 
tainly be applied in situations in which there is dependence amongst the 
random variables; see [11] which uses an approach similar to ours, involving 
the multivariate normal Stein equation, to obtain approximation theorems 
for Pearson’s statistic. This is point is discussed further in Section 5. 

In Section 4, we apply the general bounds of Theorems 3.2-3.5 to obtain 
approximation theorems for some asymptotically chi-square and variance- 
gamma distributed statistics. We also consider an example involving the 
chi-distribution (x{d) = <?(Z) = (Z'f + • • • + Zj) 1 / 2 ). This is an interesting 
example because even though g is an even function, we cannot achieve a 
O^n^ 1 ) bound because g fails one of our differentiability conditions: its sec¬ 
ond order partial derivatives are undefined at the origin. Further applications 
to product normal approximation and the delta method are also considered. 

The rest of the paper is organised as follows. In Section 2, we obtain gen¬ 
eral bounds for the derivatives of the solution of the normal and multivariate 
normal Stein equations with test function h(g(-)). Explicit estimates for the 
solution are given in the case that the partial derivatives of g have poly¬ 
nomial and exponential growth. In Section, 3 we bound the distributional 
distance between the random variables g(W) and g(£ 1//2 Z), in a smooth 
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test function metric, for the case that the components of W are sums of 
independent random variables. Here, we assume that the lower order deriva¬ 
tives of g have polynomial growth. When g is an even function we are able 
to achieve faster convergence rates via symmetry considerations. In Section 
4, we consider some applications of the general results of Section 3. Finally, 
in Section 5, we summarise the results of this paper and consider directions 
for future research. 


2. Bounds for derivatives of the solutions of the normal and 
multivariate normal Stein equations 

2.1. Preliminary results 


We begin this section by obtaining a simple bound for the partial derivatives 
of the test function h(g( w)). Before deriving this bound, we state some 
preliminary results. The first is a multivariate generalisation of the Faa di 
Bruno formula for n-th order derivatives of composite functions, due to [21]: 


Q n 


n u dw i 


-h(g(w)) 


e fc(w) (»(w)) • n 

7rElI B^TT 


d\ B \g( w) 
nj-eB dw i ’ 


where n runs through the set n of all partitions of the set {1,... ,n}, the 
product is over all of the parts B of the partition n, and 151 is the cardinality 
of the set S. It is useful to note that the number of partitions of {1 ,... ,n} 
into k non-empty subsets is given by the Stirling number of the second kind 
{3 = nE‘=o(-l)*-'©/* (see [23]). 

We now introduce two classes of functions that will be used throughout 
this paper. We say that the function h : I C M —>• M belongs to the class 
C£(I) if exists and is absolutely continuous, with derivatives up to 

n-th order bounded. We say that the function g : —> K belongs to the 

class Cp(M rf ) if all n-th order partial derivatives of g exist and there exists 
a non-negative dominating function P : —>■ M such that, for all w € 

the partial derivatives satisfy 


d k 


n =1 ^ 


-9( w ; 


n/k 


< P(w) 


k = 1,..., n. 


If g G (7^(1^) then it is easy to see that, for all w G R d , 


n gl B 'g(w) 


< P(w). 
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This inequality allows us to obtain a compact bound for the partial deriva¬ 
tives of the test function h(g(-)), which in turn allows us to obtain relatively 
simple bounds for the solution (1.10) of the Stein equation (1.8). 

Lemma 2.1. Suppose h G C"(M) and g G Cp(M d ). Then, for all w € M d , 


gn 


n u dw ^ 


%(w)) 


< h n p( w), 


where h n = ELi {fc}ll^ (fc) ll- 
Proof. From the above it is clear that 


( 2 . 1 ) 


d r 


n?=i d Wij 


h(g( w)) 


<Eh* (W) i 

7rElI 


p( w) = E 


fc=i 


P(w), 


as required. □ 

We will also make use of the following lemma. 

Lemma 2.2. Suppose h € (7”(R) and g G C^(M), where a > 0 is a constant. 
Then the solution (1.10) of the Stein equation (1.8) is bounded by 

||u>/ (n) H|| < \\(hog) n ~ L \\ < ah n -\. 

Proof. The inequality \\wf( n \w)\\ < \\(h o g) n ~ L \\ is given in Lemma 2.5 of 
[8] and the second inequality follows from Lemma 2.1. □ 

2.2. General bounds for the solution 

Here, we obtain some general bounds for the solution of the multivariate 
normal Stein equation with test function h(g(-)). We begin with the following 
lemma, the proof of which is similar to that of Proposition 2.1 of [8]. 

Lemma 2.3. Suppose £ is non-negative definite and that h G CJf( M) and 
g G Cp(M rf ). Then, for all w G M rf , the solution (l.f) of the Stein equation 
(1.3) satisfies the bound 

poo 

<hn e~ ns EP(zfJ; z )ds, (2.2) 

Jo 


d n f{ w) 


z s,w = e s w + Vl - e~ 2s x, 


where 












R. E. Gaunt/Functions of multivariate normal random variables 


provided the integral exists. 

Suppose now that £ is positive definite and that h € and g € 

Cp -1 (M d ). Then, provided the integral exists, we have, for all w € 


d n f( w) 

IYj=i dwi, 


< h n - i min 
i<;<d 



^__ E|(£-V2z) ; P(z^ 2z )| da. (2.3) 


Proof. We begin by obtaining two expressions for the n-th order partial 
derivatives of the solution (1.4). Firstly, by dominated convergence, 


d n f( w) 

n;=i d Wlj 



gn 


mu dw ^ 


h{g(*l w z )) 


ds. 


(2.4) 


Now, provided that £ is positive definite, £ 1 / 2 exists and we may use 
integration by parts to obtain 


d n f( w) 

II-=i dwy 


f 


\/l — e 


2s 


=E 


(£” 1 / 2 Z) 


d 


in—1 


n^i 

i¥* 


Hg( z f w Z )) 


ds, 


and differentiating once more using dominated convergence gives 


d n+l f{ w) 

rpi 1 


L 


oo p —(n+l)s 


\/l — e 


2s 


=E 


(£ _ 1 / 2 Z)j 


5" 


n 


1 <j<n 


dwi. 


%( Z ^w‘ z )) 


ds. 


(2.5) 

The required bounds for the partial derivatives of the solution (1.4) now 
follow easily from (2.4) and (2.5) and Lemma 2.1. □ 


So far, we have imposed no restrictions on the dominating function P 
other than it is non-negative and that the integrals of Lemma 2.3 exist. We 
now introduce some conditions, which ensure that the integrals of Lemma 
2.3 exist and can be bounded relatively easily. 

We suppose that P can be written as P(x) = a + -Pi(x) + P 2 (x), where 
a is a non-negative constant and 

(i) . P\ and P 2 are non-negative, non-decreasing functions (for any x £ R d 

and any vector e € M. d with non-negative entries, /(x + e) > /(x)); 

(ii) . There exist non-negative constants /3d, 7 d and 5d such that, for any 

x lf ... ,x fc € R d , 


k d 

Pi(xiH-hx fe )</3 d ^Pi(x fc ) and P 2 (xH-fx fc )< 7 rf JJP 2 (< 5 d x fe ); 
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(iii). The expectations EPi(E 1//2 Z) and EP 2 (E 1//2 Z) exist; 

(hi)’. The expectations E|(Z)jPi(E 1 / 2 Z)| and E|(Z)jP 2 (E 1//2 Z)| exist for all 
i = 1,..., d. 

If P satisfies (i)-(iii), we write P € P; if P satisfies (i)-(iii)’, we write 
PGP*. 

Remark 2.1. Condition (i) is very mild, as we only need to write P in 
terms of Pi and P 2 if P is unbounded as |w| —>• 00 (otherwise we can take 
P(w) = a), in which case it seems reasonable to suppose Pi and P 2 are 
increasing functions. Condition (ii) allows the integrals of Lemma 2.3 to be 
bounded in terms of Pi and P 2 . This condition is not restrictive and allows 
many classes of functions to be considered: see the examples below. Lastly, 
(iii) and (iii)’ ensure that the integrals (2.2) and (2.3), respectively, exist. 

Example 2.2. Polynomial P. The function P(w) = A + B Y2i=i 
clearly satisfies conditions (i) and (iii)’. Condition (ii) can be verified by 
applying the crude inequality \x\ + • • • + x k \ b < k b (\x\\ b + • • • + \x k \ b ), b > 0. 
Writing P(w) = A + Pi(w), where Pi(w) = B Yli=i |^i| r S an d using this 
inequality we deduce that 

k 

Pi(wiH- fw t ) < fc’-yXwj), (2.6) 

l=i 


where r = maxi<j<rfrj. Hence, (ii) is satisfied with a = A, (3 k = k r , 7 ^ = 0 
and Pi(w) = BJ2i =1 \wi\ ri - 

Example 2.3. Exponential P. It is clear that P(w) = A exp (a |n>j| 6 ), 
where a, b > 0, satisfies (i). To verify (ii), we use the inequality \x\ + • • • + 
Xk | r < Ck, r (\x\\ r + • • • + \xk \ r ), where c k , r = max{l, k r ~ 1 }, which improves 
on the crude inequality used above. Let (w j) k = Wj k ■ Then 

P(wi H-b w fc ) 

/ d \ / d k 

= Aex p ( a ^2 \ w u H-1 ~ w ki \ b \ < Hexp ( a c k ,b ^ \wji\ b 

k j = i /v i =1 j = 1 

k / d \ fc 

= H^Jexp f ac ktb ^2 \ w ji\ b ) = A ^ k Yl p ( c k,b^j), 

1=1 ' i= 1 2 1=1 

and so (ii) holds with a = 0 , /3fc = 0 , 7 *, = *4 1_fc , = c k ,b and P 2 (w) = 

Aex.p(aJ2i = i In’ll 6 )- Clearly, (iii)’ holds if b < 2 or if b = 2 and a < 1/2. 
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Proposition 2.1. Suppose £ is non-negative definite and that h € C7 l (M) 
and g € Cp(M d ), where n > 1 and P € T. Then, for all w € M d , 


3"/(w) 

^ hn 

n u dw b- 

n 


+^2(EPi(S 1 / 2 Z)+P 1 (w))+ 72 P 2 (,52w)EP 2 ( ( 5 2 S 1 / 2 Z) 


(2.7) 


Suppose now that £ is positive definite and that h € Cf 1 (M) and g € 
Cp _ 1 (M rf ), where n> 2 and P € P*. Then, for all w € 


d n f( w) 

n;=i 


< h n - 


i min 

l<Z<d 


a 


E|(£~ 1 / 2 Z) z | +/3 2 (E|(£- 1 / 2 Z) z Pi(£ 1 / 2 Z)| 


+E|(£ 1 /2 z)z | Pl(w) ) +72E | (E -i/2 z)ij p 2((52S i/2 z) |p 2(52w) _ 

( 2 . 8 ) 


Proof. Since P G P, 

P /„SV 2 Z\ _ , p /„S 1 / 2 Z\ , p / S^Zn 

P( z s, w )— a + ^l( z s,w ) + ^2(Z SjW ) 

< a + /3 2 (Pi(e- s w) + Pi(Vl-e- 2 s £ 1 / 2 Z)) 

+ 72 P 2 (^e" s w)P 2 (, 5 2 v / l - e- 2 s S 1 / 2 Z) 

< a + /3 2 (Pi(w) + Pi(S 1 / 2 Z)) + 7 2 P 2 (< 5 2 w)P 2 ((i 2 X! 1 / 2 Z). (2.9) 


Substituting (2.9) into the integral inequality (2.2) gives 


d n f( w) 


n.';=i dwi 


< h n - 1 min 


i<Z<d Jo 


7^5 e K e - 1/2 z)-(« 


J- f p. 1 






whence on using / 0 °° d.s < /“ 7 =^ ds = 1, n > 2 , yields ( 2 . 8 ). 

Inequality (2.7) follows from a similar argument. □ 

Corollary 2.1. Fix d = 1, £ = 1 and let n> 2. Suppose h € C'^~ 1 (M) and 
g € C'p _1 (M) J where P € P*. Then, for all 


u;/ (n) (u;)| < h n —\ 


a + /feH(E|ZPi(Z)| + Pi(«;)) 


+ 72|w|P2(5 2 ra)EP 2 (d 2 Z) 


( 2 . 10 ) 
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Proof. Due to the decomposition P(w) = a + Pi(ic) + P 2 (w), we can write 
the solution as / = f gi + f g2 , where g\ € C'” _1 (M) and g 2 € Cp i "^ f L p 2 (M). 

By the triangle inequality, \f^ n \w)\ < \fgi\w)\ + \fgj\w)\, and bounding 
these two quantities using Lemma 2.2 and inequality (2.8) of Proposition 
2.1, respectively, and that E|Z| < 1, leads to the desired bound. □ 

In the univariate case, we can obtain a bound which involves two fewer 
derivatives of h and g than of the solution / (compare the following Propo¬ 
sition to inequality (1.7)); an improvement that is not possible in the mul¬ 
tivariate case (see [25]). 

Proposition 2.2. Fix d = 1, E = 1 and let n > 3. Suppose h € C'^' -2 (R) 
and g € C'p _2 (M), where P € P*. Then, for all w € M, 


|/ (n) MI < K - 2 


3a + PiH + P 2 {w) + &(EPi(Z) + |tc|E|ZPi(Z)| 


+ (1 + |u;|)PiH) + ^ 2 (EP 2 (5 2 Z)P 2 {5 2 w) 


+ E\ZP 2 (6 2 Z)\\wP 2 {S 2 w)\) 


( 2 . 11 ) 


Proof. The standard normal Stein equation is f"{w) — wf'(w) = h(g(w )) — 
E h(g(Z)). By a straightforward induction on n, 

f( n \ W ) = wf^-'Xw) + (n - 2)/ (n " 2) H + (h O ff) (n - 2) H, 


and applying the triangle inequality gives that, for every w € M, 

|/ (n) H| < I + {n-2)\f( n ~ 2 \w)\ + \(hog)^~V(w)\. 


Bounding these terms using (2.10), (2.7) and (2.1) yields (2.11). □ 

Now, we obtain estimates for the solution of the Stein equation 

V r EV^m(w) - w r VV’m(w) = , ( 2 - 12 ) 

n ; =i 9wi. 

where / is the solution (1.10) of the multivariate normal Stein equation 
(1.8). The Stein equation (2.12) plays an important role in Section 3. We 
proceed as before and the following lemma is analogous to Lemma 2.3. 

Lemma 2.4. Suppose T, is non-negative definite and that h € CU l+n (M.) and 
g € C'p +n (M rf ), m,n > 1. Then, for all w € M rf , the solution of the Stein 
equation (2.12) satisfies the bound 


d n i’ m ( W) 

n u dm, 


/ h’m+n 


coo re 

Jo Jo 


( m+n)s —nt 


EP(z; 


s 1 / 2 z,s 1 / 2 z' 

's,t, w 


ds d t, 
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where _ _ 

z s’t,'w = + e~ s y/l — e~ 2t y + \J 1 — e _2s x, 

provided the integral exists. Here Z' is an independent copy of Z. 

Suppose now that £ is positive definite and that h € C'^ n+n_2 (R) and 
g <E C™ +n ~ 2 {M. d ), where m, n > 1 and m + n > 3. Then, provided the 
integral exists, we have, for all w € M. d , 


d n lfm{ W) 

n- =1 ^ 


poo p oo (m+n)s n£ 

< Wn-2 1 gj^ d J Q J Q ^ _ e -2s ^/l _ e -2* 

x E|(£- 1 / 2 Z) fe (£- 1 / 2 Z / )iP(z2 ) w Z ’ Sl/2 z , )| d s d t. 


Proof. The solution of (2.12) can be written as 


^m(w) 



d m f 

U'jLl dw G 


KwMy) d y d *, 


where p is the probability density function of the random variable £ 1,/2 Z. 
By the dominated convergence theorem, 


a m / (z y )- - r f e - m ,nnA le - v + 

h L ‘- w v x| 


x p(x) dxds 

,- ms d m {hog) 


poo p 

Jo Jr c 


rr=i ^ 


( Z *£wM X ) dxds > 


and we can therefore write 


POO POO P 

^m(w) = / / 

Jo Jo J R 5 


d m (hog) 


nr=i ^ 


( z s ,fw)p( x )p(y) dx d y dt ds - 


By again applying the dominating convergence theorem, we have 

W) _ 

In In ^ T ij = 1 


n?=i 


I" r f c -(m + n)s c -nt d m+n (ho 9 ) x , y 

Jo Jo I*™ -Aw 


x p(x)p(y) dx dy dt ds 

/*oo POO 

/ / e -(m+n) Se -nt E 

J 0 Vo 


^ m +"(feog), S 1 / 2 Z,S 1 / 2 Z' 


Lnr=r^b 


a s,t,w 


dt ds, 
(2.13) 
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which, on applying integration by parts twice, can be rewritten as 


d n lPm( W) 

n u dw *i 


poo p 

J 0 Jo 


y/l — e 2s y /1 — e 


=E 


2 1 


d m+n 2 {hog) £ 1 / 2 z,E 1 / 2 Z'\ 

x rii<i< m +»-2^i/ Zs,t ’ w 

&k,l 


dtds. (2-14) 


The desired bounds now follow from (2.13) and (2.14) and Lemma 2.1. □ 

Proposition 2.3. Suppose Z is non-negative definite and that h G C™ +n ( R) 
and g G Cp +n (U. d ), where m,n> 1 and P G F. Then, for all w G M d , 


d n f>m{ W) 

z ^m+n 

n u dw { , 

n(m + n) 


o + /3 3 (2EPi(S 1/2 Z) + Pi 

( W )) 


+ 73(EP 2 («5 3 S 1 / 2 Z)) 2 P 2 ( ( 53w) 


(2.15) 


Suppose now that Z is positive definite and that h € C™ +n ~ 2 (M) and g G 
Cp +n — 2 (M d ), where m,n > 1 with m + n > 3 and P G J 7 *. Then, for all 
w G 


w) 


n u dw t 


< 2 h m+n -2 min 
KJfc.Kd 


aE|(E- 1 / 2 Z) fc |E|(S- 1 / 2 Z) z | 


+ /3 3 (E|(E- 1 / 2 Z) fc Pi(E 1 / 2 Z)|E|(E- 1 / 2 Z) z | +E|(E-i/ 2 Z) fc | 

xE|(E- 1 / 2 Z) i Pi(E 1 / 2 Z)| + E|(E- 1 / 2 Z) A; |E|(E- 1 / 2 Z) z |P 1 (w)) 

+ 73 J P2(<53w)E|(E- 1 / 2 Z) fc P 2 (<5 3 S 1 / 2 Z)E|(E- 1 / 2 Z) z P 2 ( ( 5 3 S 1 / 2 Z)|]. (2.16) 

Proof. By using a similar argument to the one used to prove inequality (2.9) 
we obtain 


P(zg;: z ’ sl/2z ') < a + /3 3 (Pi(w) + P 1 (S 1 / 2 Z) + P 1 (S 1 / 2 Z / )) 
+ 73W 3 w)P 2 (<5 3 E 1 / 2 Z)P 2 ( ( 5 3 £ 1 / 2 Z , ). 


(2.17) 


We then proceed as we did in the proof of Proposition 2.1 by substituting 
(2.17) into the integral inequalities of Lemma 2.4 and then bounding the 
resulting integrals. Here, in obtaining (2.16) we used the inequality 


oo poo — (m+n)s 


poo p 

Jo Jo 


nt 


y/l — e 2s y/l — e 


2 1 


■ dsdt < 


f 

Jo 


oo g—3s 


y/l — e 


2s 


: ds 


Jo 




%/l — e 


2t 


: dt 


7T 7T 

= 4 ■ 2 < 2 ’ 


which holds since n > 1 and m + n > 3. 


□ 
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Again, in the univariate case it is possible to obtain a bound for the partial 
derivatives of that involve fewer derivatives of h and g. 

Proposition 2.4. Fix d = 1 and let E = 1. Let m > 1 with m + n> 3 and 
suppose that h G C'™” 1 (M) and g G Cff~ 1 (W i ), where P G P*. Then, /or all 
wGl, 


IV^ } H| < h m —i 


2a(3 + re 2 ) + Pi (to) + P 2 H 


+ /3 2 ((1 + 2|w;|)Pi(w;) + EPi(Z) + 2|u;|E|ZP 1 (Z)|) 
+ 2/5 3 (l + «> 2 )(PiM + 2E|ZPi (Z)|) 

+ 72(P 2 (<52^)EP 2 (<5 2 Z) + 2|u;P 2 ( ( 5 2 w;)|E|ZP 2 ( ( 5 2 Z)|) 


+ 2 73 (1 + w 2 )P 2 (6 3 w)[E\ZP 2 (5 3 Z)\} 2 


Proof. The solution ip m satisfies the Stein equation 'ip!f n (w) — un'p' m {w) = 
and therefore 

I^HI = | f {m+1) (w) - vnlC(w) - i// m (w )| 

= |/ (m+1) M - wf^ m) (w) + w 2 ft'{w) - /H| 

< |/ (m+1) HI + K (m) HI + (i + ^ 2 )I^HI- 

Bounding the final three terms using (2.11), (2.10) and (2.16), and simpli¬ 
fying the resulting bound by using that E|Z| < 1 completes the proof. □ 


2.3. Bounds for polynomial and exponential P 

In Section 2.2, we gave bounds for the derivatives of / and in terms 
of a dominating function P from a general class of functions J or J 7 ,. As 
was noted in Examples 2.2 and 2.3, the functions P(w) = A + B y/_, H| r * 
and P(w) = A exp (a \ w i\ b ) are contained in these classes. Therefore 
we can obtain bounds for the derivatives of / and for the case that the 
derivatives of g have polynomial or exponential growth as special cases of the 
bounds of Section 2.2. The bounds for the case of polynomial P will be used 
in the proofs of Theorems 3.2-3.5. The bounds for the case of exponential 
P will not be further used in this paper, but may prove useful in other 
applications; for a further discussion see Remark 3.6. 

Corollary 2.2. Let P(w) = A + B Yli=i H| r S where ri > 0. Suppose E 
is non-negative definite and h G Cjf( R) and g G Cp(R d ) for n > 1. Let 
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Zi = (Z 1/2 Z)i ~ N( 0, al). Then, for all w € R d , 


d n f{ w) 

^ hn 

n-=i 

n 


d 

A + B^22 r '(\ Wi 

i =1 


Vi 


+nz,p) 


Suppose now that E is positive definite and h £ Cf 1 (M) and g £ Cp 1 (R d ) 
for n> 2. Then, for all w £ R rf , 


d n f( w) 

n}Lx dwi, 


< h n - i min 
l<Kd 


d 

AE\(S-^ 2 Z)i\ + B^2 ri (\w i \ ri E\(T- 1 /‘ 2 Z)i 

i =1 


+ E|(E- 1 / 2 Z) z Zf|) 


Suppose now that E = i^. TTjen we /iare t/ie simplified bound 

d 

A+ £^2^(1^ +E|Z| ri+1 ) . 

2=1 

Consider now the case d = 1 with E = 1. Suppose h £ C^ _2 (R) and g £ 
Cp~ 2 (R), w/iere n > 3 and P(w) = A + B\w\ r . Then, for all w £ R, 


9 n /(w) 


n ?=i ^ 


^ h n .— 1 


|/ (n) HI < h n —2 \3A + 2 r B(\w\ r+1 + 2\w\ r + \w\E\Z\ r+1 + E|Z| r )) 


Proof. The bounds follow from applying inequalities (2.7), (2.8) and (2.11) 
with P( w) = A+B Yli =l From Example 2.2, we have a = A, fik = k r * > 

where r* = maxi<j<rfrj, 7 fc = 0 and Pi(w) = -B Xa=i Although, by 

examining the derivations of inequalities (2.7), (2.8) and (2.11), we see that 
we can slightly improve on these bounds by using the inequality Pi(wi + 
• • • + Wfc) < Y^!j=l k Ti P\ ( w j), instead of inequality (2.6). Finally, we simplify 
the hnal two bounds by using that E|Z| < 1. □ 

Corollary 2.3. Let P( w) = A + B Yli=i I w i\ ri > where rt > 0. Suppose E is 
non-negative definite and h £ Cfi l+n ( R) and g £ Cp +n ( R d ) /or m,n > 1. 
Then, /or all w £ R d , 


d n i> m (yv) 


n -=i 

n(m + n) 


d 

A + bY l 3 r< (K| r< + 2E\Zi\ ri ) 
2=1 
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Suppose now that £ is positive definite and h £ C™ +n 2 (M) and g £ Cp +n 2 
for m, n > 1 and m + n > 3. Then, for all w £ M 0 *, 

^m(w) 


n u 


< h m+n - 2 min 


l<fc,Z<d 


j4E|(£ _1 / 2 Z) fc |E|(£ _1 / 2 Z)i| 

+ S^3 ri (|n;ir i E|(£- 1/2 Z) / |E|(£' 1/2 Z) fc | 
2=1 

+ 2E|(£- 1 / 2 Z) fe |E|(£- 1 / 2 Z)^|) 


Suppose now that £ = I f j- Then we have the simplified bound 

d 


d n if m (w) 


nu dw. 


£ hm+n —2 


A + -B^3 r< (H r< + 2E|Z| ri+1 ) 


2=1 


(2.18) 


Consider now the case d = 1 with £ = 1. Suppose h £ C? 71 1 (M) and 


g £ Cp 1 (M), w/iere m > 2 and P(u>) = £L + .B|u;| r . TTiert, for all w £ M, 
|V4? } MI < /im-1 [a( 6 + to 2 ) + 2 • 3 r ' J B(|^| r '+ 2 + 2M r+1 + 2| W | r 
+ 2E|Z| r+1 (l + M +w 2 ) + E|Z| r ) . 

Proof. The proof is analogous to that of Corollary 2.4. □ 


We now state bounds for the case that the dominating function P grows 
exponentially. The proofs are similar to those for the polynomial growth 
case and are omitted. 

Corollary 2.4. Let P( w) = Aexp(a'f2i = i\u’i\ b ), where a,b > 0. Suppose 
£ is non-negative definite and h € (7”(R) and g € Cj^R^) for n > 1. Let 
Zi = (£ 1 / 2 Z)j ~ JV(0, <7 2 j). Then, for all w £ M. d , 


d n f{ w) 


n u 9w l 


Ahr. 


<-- exp ( a'Y^C 2 ,b i \w i \ hl jEexp ( a'^ j c 2 ,b i |(£ 1/2 Z)i| 6i ). 

n V i=i J V i= l ' 

Suppose now that £ is positive definite and h £ R) and g £ C'p“ 1 (M <i ) 

/or n >2. Then, for all w £ R rf , 


9 n /(w) 


n u 


< Ah n - 1 exp ( a £ 02,6iki 
^ i=l 


x min E 
Kl<d 


(£ 1 / 2 Z),exp | a^^c 2 ,6 i |(£ 1,/2 Z)i| 


2=1 
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Suppose now that £ = Then we have the simplified bound 


d n f( w) 


n u dm 


< Ah n -\ exp (a 


'^2c 2 , bi \wi\ 

i=l 


min E 

Kl<d 


Zi exp (a £ c 2,bi\Zi 


1=1 


Consider now the case d = 1 with £ = 1. Suppose h G Cf 2 (M) and g G 
Cp~ 2 (M), where n > 3 and P{w) = A + B\w\ r . Then, for all w€K, 


|/ (n) 0)| < Ahn -2 exp(ac 2 ,b\w\ b ) 1 + Eexp(ac 2 , 6 |E'| b ) 


+ |u;|E|Z exp(ac 2 , 6 |E'| b )| 


Corollary 2.5. Let P( w) = j4exp(aXf=i \wi\ b ), where a,b> 0. Suppose £ 
is non-negative definite and h € C'™ +n (M) and 5 € C'p" l " n (M d ) for m,n > 1. 
Then, /or all w € 


d n lfm (w) 


n-=i 


< 




m+n 


n(m + n) 


2—1 


exp | a ^ C 3Al^| 6 l )U(a^ C 3 A |(£ 1 / 2 Z) 




2—1 


Suppose now that £ is positive definite and h G C™ +n 2 (M) and g € Cff +n 2 (M d ) 
/or m, n > 1 and m + n > 3. T/ien /or all w € 


d n V> m (w) 


n - =1 


< A/i m+n _2 exp ( a ^2 c 3,bi \ w i\ 
' *=i 


X min < E 
l<Z<rf 


(£- 1 /2 Z ) / exp(a^ C3A |(£ 1 /2 Z ) j 


2—1 


Suppose now that £ = Id- Then we have the simplified bound 

f d 

< Ah m+n _ 2 exp ( a ^ c 3j6i | Wi \ 


d n fim( W) 


n -=i ^ 


2—1 


X min < E 
l<Kd 


Zi exp ( o^c 3 ) 6 i |Zi| 6i 


2 — 1 


Consider now the case d = 1 with £ = 1. Suppose h G C™ 1 (M) and 
g G C'p^ 1 (M), u>/iere m>2 and P(w) = A + B\w\ r . Then, for all aiGl, 


IV^MI < Ah m _\ exp(ac 3 j b|io| 6 ) [l + E exp(ac 2 ) fe|^| 6 ) 


+ 2 |n;|E|Zexp(ac 2 , 6 |Z| fe )| + 2(l + n; 2 ){E|Zexp(ac 3 ife |Z | 6 )|} 2 
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3. Bounds for the distributional distance between g( W) and g( Z) 

With the bounds for the derivatives of the solution of the multivariate normal 
Stein equation with test function h(g(-)) stated in Section 2, we are in a 
position to obtain bounds for the distributional distance between g{ W) and 
its limiting distribution ^(E 1 / 2 Z). Such bounds can be achieved by bounding 
the expectation E[V T EV/(W) — W T V/(W)] by using various coupling 
techniques that have been developed for multivariate normal approximation 
(see [13], [14], [26] and [22]), where the coupling is chosen based on the 
dependence structure of W. 

For the rest of this paper, we shall consider the case that W = (W \,..., Wd), 
where Wj = -^= Yl7=i Xij an d the X{j are mutually independent (as a re¬ 
sult, in this section, we shall mostly be taking E = Id). From here on, W will 
denote such a random vector. The restriction to this class of statistics allows 
for a detailed investigation of convergence rates, and we would expect that 
the factors effecting convergence rates here (matching moments, whether g 
is an even function, and the differentiability and growth rate of g) to also 
to apply in more general settings. 

3.1. Preliminary lemmas 

We begin by obtaining bounds for the distributional distance between g( W) 
and g(Zi) in terms of the derivatives of the solution of the MVN(0, E) Stein 
equation with test function h(g(-)). We give two bounds: one for general g 
and another for when g is an even function. In Section 3.2, we apply these 
bounds and those of Section 2.3 to bound the distance for the case that the 
derivatives of g have polynomial growth. 

Unless otherwise stated, in this section, / will denote the solution (1.10). 
We shall also let C k (M. d ) denote the case of real-valued functions defined on 
W l whose partial derivatives of order k all exist. 

Lemma 3.1. Let ..., X nij i ,..., • • •, ^n d ,d be independent ran¬ 
dom variables with = E Z k for all 1 < i < nj, 1 < j < d and all 

positive integers k < p. Let E = Id and suppose h and g are such that 
f € C^ +1 (M rf ). Then, if the expectations on the right-hand side of (3.1) 
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exist, 


|E%(W))-E%(Z))| 

d rij 

j = 1 »=1 vr 




A --' lP +i' W i,0' 


u dw? 


+ - sup E 
P e 


xv +ldP+lf (w^) 

Xij dw p+1 


(3-1) 


where W= W(^ + -|=Xjj /or some 0 € (0,1), and the random, vector 
Xjj has as its j-th entry and the other d — 1 entries are given by zero. 
Proof. We aim to bound E/i(g(W)) — E/i(g(Z)), and do so by bounding the 
quantity £? =1 e[0(W) - Wj£-(W)]. We let W^) = W - -^X, y , 

Then, Taylor expanding J^(W) and J^-(W) about W^O) gives 
d r q2 j- 

X> 


J=1 


<9^ f d f 

J (W)-Wj-^-(W) 


dw j 


dwi 


d nj i f> 2 f 1 

J=12—1 J J j= 12—1 

d rij p -2 

E E E , . fc/2+l "“'3 0„.,fc+2 

j=l t=l fc=0 dw j 

d n j p— 1 

“ E E E ZTTfc/2+1/2 Q,„fc+1 

j=0 i =1 fc=0 


n,, EX «J|< w) 


EX^E_4(W^ ) ) 


Exj! +1 9 t (W^ } ) + + i?2, 


(3.2) 


where 




1 


^ “ EE, lVn (p+1)/2 o 

7=1 i=i (P - l) !n ,- 0 


supE 


p+ il vv j,eJ 


J 9m 


d n 3 - 


Iflal < EE- 


i 


■ sup E 


P+1 d p+1 f (i) 

d^ pTT( j ’ d) 


^ ril rj ( P+ 1 )/ 2 0 

j=i ?;=i P !ra j y 

Using independence and that the Xjj have mean zero and collecting terms, 
we can write the right-hand side of (3.2) as 

d n j p —1 i ofc-fl , 

E E E 77T73U7J - EI f 1 ] E ^( wM ) + «! + «*• 

j=l i=l fc=l ^- n j 
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Now, by the matching moments assumption, kKX k - 1 — EX k J~ x = k~EZ k 1 — 
E Z k+1 for all 1 < k < p — 1. But the moments of the standard normal 
distribution satisfy kEZ k ~ 1 — E Z k+1 = 0 for all k > 0. Thus, |E/i(g(W)) — 
Eh(g(Z))\ < |i?i| + |i? 2 |) and the proof is complete. □ 

Remark 3.1. In the statement of Lemma 3.1, we did not give precise con¬ 
ditions on h and g such that / G Cf >+1 (M d ), nor restrictions on the Xij 
such that the expectations on the right-hand side of (3.1) exist. In applying, 
Lemma 3.1 in practice (see Section 3.2), one would need to check that h, g 
and the X t] are such that these conditions are met. These comments apply 
equally to Lemmas 3.2-3.5. 

We now turn our attention to the case that g is an even function. The 
following key lemma enables us to obtain faster convergence rates in this 
case. 


Lemma 3.2. Let E be non-negative definite. Suppose that g : —>• R is an 

even function (g( w) = g{— w) for all w G M. d ). Then, the solution (1.10), 
denoted by f, is an even function. Moreover, for odd k > 1, provided that 

d k f( w) 


itu 9wi. 


exists, 


E 


5 fc /(sV 2 Z) 

if the expectation in (3.3) is well-defined. 


= 0 , 


(3.3) 


Proof. As TJ/ 2 Z = -E x / 2 Z and g is an even function, we have, for any 
w G R d , 


/(-w) = - / E[%(-e" s w + y/l - e -2s E 1/2 Z)) - Eh(g(Z))] ds 

Jo 

[h{g(-e ~ s w - Vl -e- 2s S 1/2 Z)) - Eh(g(Z))] ds 

[h(g(e ~ s w + y/l - e _2s E 1 / 2 Z)) - Eh(g(Z))\ ds = /(w), 

and therefore the solution (1.4) is an even function. 

Since / is an even function, the partial derivatives of odd order are odd 
functions, provided they exist. Therefore, since E 1 / 2 Z = —E 1//2 Z, it follows 
that (3.3) holds. □ 

It is interesting to note that there exists a partial converse to Lemma 3.2. 
In proving this result, we shall need the following lemma. 
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Lemma 3.3. Here, let f denote the solution (1-4) of the MVN(0,E) Stein 
equation with test function h. Suppose E is non-negative definite. Then, for 
h G C k {R d ), k > 1, 


’c? fe /(S 1 / 2 Z)’ 

- X E 

'd k h(T}/ 2 zy 


k 

uU dw ‘. J 


(3.4) 


provided the expectations exist. 

Suppose now that E positive definite. Then, for h G C' fc_1 (M d ), k > 2, 


E 


'd fc /(E 1/2 Z)' 

.TUT/. 


1 

k 


E 


(S” 1/2 Z) 


^“^(E^Z)' 

UiSKk d w ii J ’ 

3/1 


for any 1 <1 < d, provided the expectations exist. 

Now, set d = 1 and E = 1. Then, for h G C k ~ 2 (M. d ), k > 3, 


E f {k \Z) = —J-E[(l + z 2 )h {k - 2 \z)\, 

Kj 


(3.5) 


(3.6) 


provided the expectations exist. 

Proof. Recall that the solution of the MVN(0, E) Stein equation with test 
function h is given by 


roc _ 

/(w) = - / [E/i(e -s w + \J\ — e _2s S 1 / 2 Z / ) - E/i(E 1 / 2 Z)] d s, 

Jo 

where S 1//2 Z' is an independent copy of E 1,/2 Z. Differentiating / using the 
dominated convergence theorem and then evaluating at S 1//2 Z and taking 
expectations gives 


E 


'd k f(T 1 / 2 zy 
.rRi h - 




(e" s E 1/2 Z+v / l-e- 2s E 1/2 Z') 


ds. 


But e _s S 1//2 Z + \/l — e _2s E 1//2 Z / = S 1 / 2 Z, and so on evaluating the integral 
Jp 00 e _fcs ds = \ we obtain (3.4). For /i G C fc (lR d ), we immediately obtain 
(3.5) by applying integration by parts to (3.4), and (3.6) follows from another 
application of integration by parts. It is not hard to see that, by arguing more 
carefully, the differentiability assumptions can be weakened to h G C fc ~ 1 (]R (i ) 
and h G C k ~ 2 { R), respectively, but we omit the details. □ 

We can now establish a partial converse to Lemma 3.2. 
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Lemma 3.4. Let f denote the solution (1.10). Suppose E is non-negative 
definite and let k > 1 be odd. Suppose that h E C k ( R), and g E C k ( R rf ) 
is such that both expectations in equation (3./) (with h replaced by h(g(-))) 
exist for any h E C k ( R). Then if 


E 


’<9 fc /(S 1/2 Z)’ 


= 0 , 


for all h E 


(3.7) 


it follows that g( w) = g(—w) /or almost all w E M d . 

./Vow, let k > 3 6e odd. If SI is positive-definite, the same conclusion holds 
if we replace the conditions involving h and g by h E C^~ 1 (M), and g E 
C k ~\R d ) is such that the expectations in (3.5) (with h replaced by h(g(-))) 
exist for any h E In the univariate case (d = 1), we can weaken 

the assumptions further to h E and g E C k ~ 2 (M. d ) is such that the 

expectations in (3.6) (with h replaced by h{g{-))) exist for any h E C k ~ 2 (M). 

Proof. We prove the first assertion; the other two assertions can be proved 
similarly. Suppose that the expectation in (3.7) is equal to 0 for all h E 
C k (R). Then, by the assumptions of the lemma and equation (3.4), 


0 = E 


= E 


d k 




d k 


lULidwi, 


%(£ 1/2 Z)) 

%(-£ 1/2 z)) 


(3.8) 

(3.9) 


for all h E C k (M). Here, we used that E 1 / 2 Z = —E 1//2 Z. Combining (3.8) 
and (3.9), we have that, for all h E C k (U), 


[ [ Qk 

Jm d l rij=i dxij 


h(g(.*-)) 



Hg(-x)) 


|p(x) dx = 0, 


where p is the MVN(0, E) p.d.f.. But as the above integral is equal to 0 for 
all h E C k (M), it follows that g(x.) = g{— x) for almost all x E R d . This is 
because if g(x) and g(— x) were to differ on an interval with positive Lebseque 
measure, it would be possible to chose a function h from the class C k (R) 
such that the integral would be non-zero, resulting in a contradiction. □ 

With the aid of Lemma 3.2, we are able to obtain an analogue of Lemma 
3.1 for the case that g is an even function. The symmetry of the function 
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g allows us to obtain faster convergence rates, for smooth test functions h. 
The following partial differential equation shall appear in our proof: 


hi v dw l 


(w) - Wk 


dw k 



d p+1 f 

dw*j +1 


(w). 


(3.10) 


Bounds for the solution ipj and its partial derivatives were given in Sections 
2.2 and 2.3. 

Lemma 3.5. Let Xyi,..., X nii i,..., Xi jC + • • •, X ndt d be independent ran¬ 
dom variables with EX*- = E Z k for all 1 < i < rij, 1 < j < d and all 
positive integers k < p. Suppose g : —> R is an even function. Suppose 

further that the solution (1.10), denoted by f, belongs to the class C' p+2 (M' i ) 
and that the solution tpj to (3.10) is in the class C 3 (M d ). Then, if the expec¬ 
tations on the right-hand side of (3.11) exist, 


d rij 


1 


|E%(W))-E%(Z))| < EE , s/2+1 

j=i i i O -" ;, 


supE 
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P + 1 e 

d rij 
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+ |EXf +1 | supE 
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x ij dP+ L (w f e ) 

°dv^ +2y J ’ d> 


*3 


+ YY^— 

^ Z—j , (p+l)/2 
j =1 i =1 pmj i_i 7—i n 


a iik -i ( 

ee 4 ?{ 

k= i i=i n k 1 


sup E 
e 


X, 


Ik 




dwl 


(W, 


fc,0' 


+ - sup E 
^ 0 


Xi 


s 

lk d w k 




(3.11) 


where is defined in Lemma 3.1. 


Proof. By a similar argument to the one used in the proof of Lemma 3.1, 
we have 


E e 

i=i 


l}2f (w)-w,fL {W) 


dw] 


d rij 

“ , p/2+1/2 

i=i i=i Pin/ 


dwj 


[pEXfO 1 - EX^ + 1 ]e|^(W^)) + + i? 2 , 
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where 


d rij 


i*ii £ EE 

i-\ i.i pm, 

d rij 

i^i < EE 
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Ir) p/ 2+ i s 7 e 


x p d p+ -f (i) 


3,0 > 


(p + l)\n p / 2+1 o 


supE 


x»«EX (w «>- 


By the matching moments assumption, EX?- = E Z p 1 = 0. Using this 
fact and Taylor expanding d Q + v + 1 (W^’ J )) about W gives 


E e 

3 =1 


82/ (W)-^-|h(W) 


dw'j 


dvjj 


d rij 

-EE- 
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, P +i„8’’ +1 / 




(w (i ^) 


where 
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* - -EE 


i (P+ 1 )/ 2 

j = l j=l P' n j 
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= X + f?i + r 2 + r 3 


EX? +1 E ° P+ „ (W), 




9^ +1 


d rij 


l^sl < EE 


|EX|; +1 |siipE 


| p/ 2 +! 
j=X i =1 P'^j 


*7 


x„.^iC(w« 


^ +2 


3,0 > 


To achieve the desired 0(n 1 p ^ 2 + • • • + n d P ^ 2 ) bound we need to show that 
E (W) is of order nh 1 ^ 2 H-hnT 1 ^ 2 , since in general EX? +1 / 0. We 

oWj i a lj 

consider the MYN(0,/^) Stein equation with test function dP+ p+ { : 


I ^3 ( ^ d ^3 ( \\ dP+1 f , ^ E7 

M9? w -"W W ) = st w ^ e 


k =1 


® W j 

d P+lf 


dw p 


p +1 


(Z) 


By Lemma 3.2, we have that E |—^4 (Z) = 0, and therefore 


E 


d p+1 / 


(W) 


d 2 ip 


E e 

fc=i 


' J (W)-W fc |^(W) 

dw k 


dw 2 k 


(3.12) 


We can use Lemma 3.1 to bound the right-hand side of (3.12), which allows 
us to bound N. All terms have now been bounded to the desired order and 
the proof is complete. □ 
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3.2. Approximations theorems for polynomial P 


Lemmas 3.1 and 3.5 allow one to bound the distributional distance between 
g(W) and g(Z) if bounds are available for the expectations on the right- 
hand side of (3.1) and (3.11), respectively. In this section, we obtain such 
bounds for the case that the derivatives of g have polynomial growth. For 
space reasons, we do not give bounds for the case of g with derivatives of 
exponential growth; see Remark 3.6 for a further discussion. We begin by 
proving the following lemma. 

Lemma 3.6. Let P(w) = A + B Yli=i I w i\ ri > where A, B and r ±,..., rd are 
non-negative constants. Suppose T, = Id and let q > 0. Then 


E 


E 


yl f fw(L 

X ^dw p A W h e > 


X1 d ’ j t ( W w- 
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(3.13) 
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where the inequalities are for g in the classes Cp(M d ), (7p 
respectively. Suppose now that d = 1 and £ = 1. Then 
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and Cp +1 


E\Xff^(W^)\ <h p - 2 3AE\Xi\ q + 2 r B(2 r+1 E\Xi\ q (E\W\ r+1 + E\W\ r ) 
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where the inequalities are for g in the classes C v p 2 (M) and Cp~ 1 (M) ; re¬ 
spectively. 

Proof. Let us prove the first inequality. From inequality (2.18) we have 


E 


8 n f r\ 
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n 


a 

AE\Xij | q + B^2 2 r * (E|X?. (W? 


')yt| 

e> 


k =1 


+ E\Z\ rk E\X, 


ij\ 


By using the crude inequality |a + b\ s < 2 s (|a| s + \b\ s ), which holds for any 

(i) 

s > 0, and independence of Xjj and W- , we have 
E| X? j (W$) rk \ < 2 rfc E 

< 2 rk (E|Xy|«E|wfr* + -^E| X i:j \ rk+qS J . (3.14) 

Using that E\W^\ rk < E|Wj| rfe leads to the desired inequality. This can be 
seen by using Jensen’s inequality: 

E| Wj\ r * = E[E[|Wj + nJ 1/2 Xij \ rk \ wf]\ 

> E\E[Wj + nJ 1/2 Xij | Wij]\ rk = E\wf\ rk . 

The proofs of the other inequalities are similar, with the only difference 
being that we also make use of the inequality 

E|X?.(W r $) r * +I | < 2 rk+l ~ 1 (E\Xij\ q E\W^\ rk+l + -^E\X ij \ rk+l+< ^ , 

for l = 1,2, which is obtained via an analogous calculation to the one used 
to obtain (3.14), but here we used the inequality |a + 6| s < 2 S—1 (|a| s + |6| s ), 
which holds for any s > 1. □ 

By applying the inequalities of Lemma 3.6 to the bounds of Lemmas 3.1 
and 3.5, we can obtain the following four theorems for the distributional 
distance between g( W) and g{ Z) when the derivatives of g have polynomial 
growth. Theorem 3.2 follows from using inequality (3.13) in the bound of 
Lemma 3.1, and the other theorems are proved similarly. 
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Theorem 3.2. Let X \\,..., X n i ,..., X\ ^,..., X n ( j be independent ran¬ 
dom variables with EX^- = E Z k for all 1 < i < nj, 1 < j < d and all 
positive integers k < p. Suppose also that E|Xjj| n+p+1 < oo for all i, j and 
l. Let P( w) = A + B Y2i =i |^j| r S where A, B and r\,..., r^ are non-negative 
constants. Suppose g € Cp(M d ). Then, for h € C P (R) ; 


|E%(W))-E%(Z))| 

, d nj d 

< TFTIvi + B E 2r ‘ ( 2’-‘E|X iJ f+ 1 E|W J f‘ 

j 


7=1 i=l ^ 


fc=l 


+ 


2 r fe 

rfc/2 


E|X^P +P+1 +E|Zr fc+1 E|X ij | p+1 


n 


Theorem 3.3. Let Xi,..., X n 6e independent random variables with EX^ = 
EZ fc /or all 1 < i < n and all positive integers k < p. Suppose also that 
E|Xj| r+p+2 < oo for all 1 < i < n. Let P(w ) = A + B\w\ r , where A, B and r 
are non-negative constants. Suppose g G Cp -1 (M). Then, for h G C p_ 

\Eh(g(W))-Eh(g(Z))\ 

< P +1 


^p +1)/ 2 h P -£ 3TE|Xj| p+1 + 2 r 5( 2 r+1 E|Xj| p+1 (E|IT| r+1 +E|VK| r ) 

i=i L \ 


nr+2 \ 

+ 4E|Z| r+1 E|Xj| p+2 + —-E|Xj| r+p+2 

n r / 2 / 


Theorem 3.4. Let Xi i,..., X n i,..., Xi^,..., X n ^ 6e independent ran¬ 
dom variables with EX- = EZ^' /or all 1 < i < nj, 1 < j < d and all 
positive integers k < p. Suppose also that E| A/j| ri+p + 2 < oo /or all i, j 
and l. Let P( w) = A + B Yli=i I w i\ ri > where A, B and r\,... ,rd are non¬ 
negative constants. Suppose g € Cp +2 (M d ) is an even function. Then, for 
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h G C P+ 2 (R), 
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Theorem 3.5. Let X 1 ,..., X n &e independent random variables with EX^ = 
EZ fc /or all 1 < i < n and all positive integers k < p. Suppose also that 
E|Xj| r+p+4 < 00 for all 1 < i < n. Let P(w ) = + where A, B and r 

are non-negative constants. Suppose g G Cp(R) is an even function. Then, 
for h G C p { R), 

\m(g(W)) -Eh(g(Z))\ < (^±1 + |EY?« 
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“t VP + 1 

3AE|Xi | p+2 + 2 r B ( 2 r+ 1 E|Xj| p+ 2 (E| VF | r+1 + E|VF| r ) 


ir +2 
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+ 16E|Z| r+1 E|X / | p+4 + —T-E|X ;| r+p+4 

n r B ) 
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Remark 3.6. From Theorem 3.2, we see that if the first p moments of the 
Xjj match those of the N( 0,1) distribution then, provided E|Xj / ri+p+1 < oo 
for all i,j, l and g G C^(R d ) for Pi(w) = A + BY^i=\ \wi\ ri , the rate of con¬ 
vergence of g( W) to <?(Z) is 0{n~( p ~ 1 ^ 2 ), where n = mini <j<d n j. For this 
rate of convergence, we require that the test function h is in the class C P (R). 
By Theorem 3.4, it follows that, for even p, the rate of convergence can be 






















R. E. Gaunt/Functions of multivariate normal random variables 


29 


improved to 0(n p / 2 ) if we strengthen the assumptions to E|Xjj | ri+p + 2 < oo 
for all i,j,l, that is an even function, and that h € When 

d = 1, we see from Theorems 3.3 and 3.5 that we can weaken achieve these 
convergence rates with weaker assumptions on g and h, possibly at the ex¬ 
pense of stronger conditions on the existence of absolute moments of the 
Xi. 

One could derive analogues of Theorems 3.2-3.5 for the case that the 
dominating function of g is of the form = Aexp(a ^)^ =1 \ w i\ b )i where 

0 < b < 1 / 2 . Such results could be derived by using the bounds of Sec¬ 
tion 2.3 to obtain an analogue of Lemma 3.6 and substitute the result¬ 
ing bounds into the general bounds of Lemmas 3.1 and 3.5. We do not 
present these results for space reasons, but note here that it can easily be 
seen that similar principles regarding the rate of convergence of g( W) to 
g{ Z) would apply. For example, the analogue of Theorem 3.2 would give 
a bound on |E/i(g(W)) — E/i(g(Z))| that was 0(n~^ p ~ 1 ^ 2 ) under the fol¬ 
lowing assumptions. The first p moments of the Wj would also agree with 
those of the X(0,1) distribution, although the absolute moment assump¬ 
tion would be replaced by the condition that an expectation of the form 
exp(/3|Wj| 7 )] would exist, for some a, /3 ,7 > 0. The assumptions 
on g and h would be exactly analogous with g € Cp 2 (M. d ) and h € 

4. Applications 

In this section, we apply the theorems of Section 3.2 to obtain distributional 
bounds for statistics that are asymptotically chi-square, variance-gamma 
and chi distributed. We also consider an application to the delta method. 
The applications are chosen to illuminate the results of Section 3. 

4.1. A chi-square approximation theorem 

Our first application is a bound for the rate of convergence of an asymptot¬ 
ically chi-square distributed statistic in Wasserstein distance (set ||h/|| = 1 
in the following corollary). To date, a bound for the rate of con¬ 

vergence in Wasserstein distance has not been derived from the chi-square 
Stein equation. 

Corollary 4.1. Let X, Xik, X 2 k, ■ ■ ■ ,X n k, 1 < k < d, be i.i.d. random vari¬ 
ables with EX = 0, EX 2 = 1 and E|X| 5 < 00 . Define Wk = Y^i=i Xik 
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and let S d = Ylk=i Then, for absolutely continuous h, 


WSi) - X 2 W h\ < 


2_, E|X | 5 


E|X | 3 + W-EX 4 + 

' 7r x /n 


where xf d )h denotes the expectation of h(U) for U ~ xf d )- 

Proof. The bound for d = 1 follows from applying Theorem 3.3 with g(w) = 
w 2 . Here g'(w) = 2w, so we take P(w) = 2w as our dominating function, 
and applying the theorem with p = 2, A = 0, B = 2 and r = 1 proves the 
result for the case d = 1. In obtaining the bound, we used that E|W| < 
(IEIT 2 ) 1 / 2 = 1 and that E|Z | 3 = 2y/2 /V. 

We now use this bound to obtain a bound for general d. Define Uq = 

EU W h V* = E?=i 2? ~ xf d) and U k = El, z? + Wf for 

k = 1 , ,d — 1. With this notation, we have 


ms d )- x 2 d] h\ = 


'*T,Eh(U k )-Eh(U k - 1 ) 


k =1 
d 


< ^E|E [h(U k ) - h(u k - 1) \z u ..., W fe+ 1, ...,W d ]|. 

k=1 

(4.1) 


Since || g^\x + c)|| = || 5 ^(a:)|| for any constant c, we may use our bound 
for the case d = 1 to bound the inner expectation of (4.1). Therefore, as the 
Xij are identically distributed, we can bound E h{S d ) — xf^h by our bound 
for Eh(S\) — X?i)h multiplied by d. □ 

Due to the symmetry of the statistic S d , we can obtain faster convergence 
rates for smooth test functions: 

Corollary 4.2. Let X, X\k, X 2 k , ■ ■ ■ ,X nk , 1 < k < d, be i.i.d. random vari¬ 
ables with EX = 0, EX 2 = 1 and EX 8 < 00 . Then, for h € C 2 (M + ) ; 


\m(S d )-x 2 {d) h\<-{\\h"\\ + \\h'\\) 


43 

22EIW1I 3 + 40EIXI 5 + —Elxr 

n 

/ 2592 

+ |EX 3 | 1312EX 4 EW 1 4 + 3974EX 6 +-EX 8 




n 


Proof. The bound for d = 1 follows from applying Theorem 3.5 with g(w) = 
w 2 . We take P(w) = 2+4u ; 2 as our dominating function, since \g'(w)\ 2 = Aiv 2 
and g"{w) = 2 are both bounded by P(w) for all w € M. Applying the 
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theorem with p = 2, A = 2, B = 4 and r = 2 proves the result for the case 
d = 1. The extension to d > 2 follows from the same argument as used in the 
proof of Corollary 4.1. To obtain a more compact form for the final bound, 
we simplify by using that 1 < E|X| a < E|X | 6 and 1 < E|Wi| a < E|lTi| 6 , for 
2 < a < b, which follows from Holder’s inequality. We also round numbers 
up to the nearest integer. □ 

Remark 4.1. We could also have used Theorem 3.4 to obtain a 0(n~ l ) 
bound. However, we would require that h G C^(M+) and the dependence on 
d would be worse: the resulting bound would be 0(d 3 n~ 1 ). 

A bound for the quantity E h.(Sd) — xfd) h as a ^ so been obtained by [ 11 ], 
through a different approach involving the chi-square Stein equation. They 
also required that EX 8 < oo; however, unlike ours, their bound does not 
grow with d, although they do require that h G 

4-2. Variance-gamma approximation 

Here we consider an approximation theorem for the statistic T,f = Ylt=i WkWd+k, 
which convergences to the random variable Vd = Ylt=i ZkZd+k- This ran¬ 
dom variable has a variance-gamma distribution (see [7]) with density 

d—l 

1 f\x\\^ r 

p(x ) =-r- — ) Kd -i (|x|), x G R, 

V ; 0 Fr(f) v 2 ) — Vl u ' 

where K u (x) = f^° e ~ xcos H t ) cosh(i/t) dt is a modified Bessel function of 
the second kind (see [23]). Unlike the chi-square application, we cannot use 
Theorem 3.3 to obtain a Wasserstein distance bound, but the symmetry of 
the statistic Td allows us to obtain a bound on the rate of convergence of 
order n ^ 1 for smooth test functions. 

Corollary 4.3. Suppose that X, ..., X rn ^\,..., ..., X m ^ and that 

Y, Y\ \,..., Y U} i ,..., Y\ d-, ■ ■ ■ i Y n d are independent random variables with zero 

V V 

mean, unit variance and bounded eighth moment, with X,,^ = X and Yjj. = 
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Y for all i, j and k. Let T d = x i,k Y j,k- Th ^n, for h <E C£ 


\Eh(T d ) - Eh(V d )\ < d(\\hW\\ + 6||/i( 3 ) || + 7\\h"\\ + H/i'll) 


43 
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ex 4 ert+- 


EX 8 \ 43 


m 2 
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d 4 ef 8 \ 

Er 4 Ew| +1 +^2-J 
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/|E^ + J EpW1781 E 
V V' m yjn J \ y/m 


1781. 


1004. 


1004. 


+ EyW|+i + _ EX » + _ 


El 


Proof. We prove the result for case d = 1 by using Theorem 3.4 with 
g(u, v ) = uv. Now, d u g = v, d v g = u, d uv g = 1 and all other partial deriva¬ 
tives are equal to zero. We can therefore take P(u,v) = 1 + u 4 + u 4 as our 
dominating function. On applying Theorem 3.4 with d = 2, p = 2, A = 1, 
B = 1 and r\ = r 2 = 4 we obtain a bound for the quantity E/i(Ti) — E/i(Vi). 
The extension to general d follows from an analogous calculation to the one 
used in the proof of Corollary 4.1. We simplify the final bound similarly to 
as was done in the proof of Corollary 4.2. □ 

Remark 4.2. In the chi-square example, we obtained a bound involving 
only the first two derivatives of h by first considering the case d = 1 (here 
g(w) = w 2 is even and a function of one variable, so we can apply Theorem 
3.5). However, we cannot use such an approach in the variance-gamma exam¬ 
ple, because we cannot decompose the function Ylt=i w k w d+k into functions 
of one variable that are also even. 

As was the case in the chi-square example, our bound performs worse for 
large d than a 0(d 1 ^ 2 (m~ 1 + n -1 )) bound for the quantity E h(T d ) — E h{V d ) 
that was obtained by [7], which also only requires that h G C 8 (]R). 


4-3. Product normal approximation 

Consider the statistic Ofc=i Wfcj which converges to the product normal 
random variable n/. = i ^k- F° r odd d, we can obtain a (^(n" 1 / 2 ) bound, for 
smooth test functions, by applying Theorem 3.2 with g( w) = nf=i w k, or by 
applying Theorem 3.3 with g(w) = w and then extending to the general case 
d > 1 by using an argument similar to that used in the proof of Corollary 
4.1. We can similarly obtain a 0(n -1 ) rate when d is even, because here g 
is an even function. 

Recently, [9] obtained a Stein equation for the product of d independent 
standard normal random variables, which is a d-th order linear differential 
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operator. Consequently, obtaining estimates for the solution for general d 
is difficult, and to date estimates are only available for d < 2. Therefore, 
applications in that paper are limited to d < 2. This example demonstrates 
the power of our approach, as, through the function g , we have essentially 
reduced our problem to that of multivariate normal approximation, for which 
there exists a more tractable Stein equation. 

4-4- Approximation theorems for the chi distribution 

If Z ~ 1V(0, 1), then \ Z\ has a chi distribution with density \J2 / ire~ x2 / 2 , x > 
0. Since |x| is continuous, it follows that \ W\ \Z\, where W = Y^i=i Xi 
as usual. Now |x| is an even function, but is not twice differentiable, so we 
cannot apply Theorem 3.5 to obtain a 0(n~ l ) bound on the convergence. 
But |x| is absolutely continuous and its derivative is bounded almost every¬ 
where, and we can therefore apply Theorem 3.3 to obtain a bound of order 
n -1 / 2 . 

More generally, the random variable gi/Zi) = (Zf + • • • + Z^) 1 / 2 has a 
chi distribution with d degrees of freedom. Again, the second order partial 
derivatives of g do not exist at the origin. This causes a slight problem 
when d > 2, because none of the bounds of Section 3 are directly applicable 
(Theorem 3.2 requires the existence of the second order partial derivatives 
of g ). Nevertheless, in the proof of the following corollary we see a simple 
way of getting around this difficulty. 

Corollary 4.4. Let the statistic Sd be defined as in Corollary 4-%- Then, 
for all absolutely continuous h, 

\mVsj - X(d)h\ < ^P(2 + E|A| 3 ), (4.2) 

y/n 

where X{d)h denotes the expectation of h(Y) for Y ~ X(d)- 

Proof. Let us first consider the case d = 1. A well-known bound of [29] for 
normal approximation is that \¥.h{W) — ~Kh{Z)\ < ^J-(2+E|X| 3 ). Replacing 
h(x) by /i(|x|) in this inequality gives (4.2) for the case d = 1. We now note 
that, for any a € R, 

\Eh{Vw 2 + a 2 ) - E h{Vz 2 + a 2 )| < W-(2 + E|X| 3 ), (4.3) 

\Jn 

which holds because | ^h(V x 2 + a 2 ) | = | -j £ ^ hfs/x 2 + a 2 )| < ||/i'||. Then, 
with the random variables U^, k = 0,1 ,..., d — 1, defined as in the proof of 
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Corollary 4.2, we have 

I Eh(y/Sd) - X(d)h\ 
d 

< J2 \z u ..., z k _ 1: w k+ 1 , ...,w d ] |, (4.4) 

k =1 

and bounding the inner expectations of (4.4) using (4.3) yields (4.2). □ 

4-5. The delta method 

Let X \,..., X n be independent random variables and let X denote the sam¬ 
ple mean. Then, by the delta method, 

V^f(X) 4 N (0, [/'('0)] 2 ) (4.5) 

for continuous functions / with /'(0) ^ 0. We can get a bound for the rate 
of convergence for the case that the derivatives of / have polynomial growth 
by applying Theorem 3.3 with g(w) = \/nf{w/y/n) (note that W = \fnX 
and g'{ 0) = /'(0)). The condition /'(0) / 0 does not hold for even /, and 
so Theorem 3.5 is not applicable. 

The limiting result (4.5) naturally generalises to functions / : R n —>• R m . 
If the partial derivatives of / are of polynomial growth, we can use Theorem 
3.2 to obtain bounds for m = 1, but none of our bounds are applicable for 
the more general case. 

4-6. Approximation of moments and related applications 

One can readily use the approach of this paper to obtain approximations 
for the moments, absolute moments, moment generating functions and char¬ 
acteristic functions of sums of independent random variables. For example, 
taking h(w) = w and g(u>) = |u;| r in Theorems 3.2 (if r > 1) and 3.3 
(if r > 2, in which case the symmetry leads to faster convergence rates) 
yields estimates for absolute moments. Estimates for moment generating 
functions of sums of independent random variables could be obtained by 
taking h(w) = w and g(w) = e tw in an analogue of Theorem 3.2 with ex¬ 
ponential growth rate. We do not, however, pursue this any further in this 
paper. 
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5. Discussion 

In this paper, we have introduced a general method for assessing the dis¬ 
tributional distance between the random variable g( W) and its limiting 
distribution ^(X 1 / 2 Z). To simplify the exposition, we mostly restricted our 
focus to the case of partial derivatives of g with polynomial growth and 
Wj = Yli =1 ^ij , where the X %3 are independent. 

This restriction allowed for a comprehensive account of the rate of con¬ 
vergence g( W). We showed that, for smooth test functions, the rate of con¬ 
vergence is faster than 0 (n -1 / 2 ) if various lower order moments of the Xij 
agree with those of the standard normal, or if the function g is even and 
satisfies appropriate differentiability and boundedness conditions. We also 
saw that the assumptions on g can be weakened if d = 1 . 

We would expect these principles regarding convergence rates to apply in 
many other settings. As was discussed in Remark 3.6, one could readily ob¬ 
tain analogues of Theorems 3.2-3.5 for functions g with partial derivatives 
of exponential growth by applying the bounds of Section 2.3. The same 
principles regarding convergence rates would apply, with the only change 
being that the condition of existence of various absolute moments of the 
Xjj would be replaced by an assumption that expectations of the form 
E[|Xy|“ exp(/3|Ajj| 7 )] exist, for some a,/3 ,7 > 0 . 

Also, because through the function g , we essentially reduce the problem 
to that of multivariate normal approximation via the multivariate normal 
Stein equation, our method can be applied when there is a dependence 
amongst the Xij. Here the dependence structure can be disentangled in the 
usual manner via the many coupling techniques developed for multivariate 
normal approximation by Stein’s method. Again, we expect that in many 
situations the convergence principles established in the independent case will 
carry through. 

To illustrate this point, [11] recently obtained a 0(n _1 ) bound (here n is 
the sample size) for the distributional distance between Pearson’s statistic 
and its limiting chi-square distribution, for smooth test functions. Let X^ 
denote the indicator random variable that the i -th trial falls in the j-th class 
[m classes in total) that has been standardised to have mean zero and unit 
variance. Then Pearson’s statistic can be written in the form g( W), where 
Wj = Y^= 1 Xij and g( w) = Y^k=l w \• Here, g is even with derivatives 

of polynomial growth, and even though there is a dependence amongst the 
X ^, a convergence rate of 0 (n _1 ) still holds. 

Finally, we note that our approach can in principle be adapted to assess 
the rate of convergence of statistics that are asymptotically distributed as 
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g(Y), where Y need not be multivariate normally distributed. Provided a 
relatively tractable Stein equation is available for the distribution of Y, our 
approach should in principle work, and we may expect similar principles 
regarding convergence rates to apply. For instance, if the p.d.f. of Y is sym¬ 
metric about the origin (as is so for the normal p.d.f.), we may expect to 
see faster convergence rates if g is an even function. 
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